# This Page Is Inserted by IFW Operations and is not a part of the Official Record

# BEST AVAILABLE IMAGES

Defective images within this document are accurate representations of the original documents submitted by the applicant.

Defects in the images may include (but are not limited to):

- BLACK BORDERS
- TEXT CUT OFF AT TOP, BOTTOM OR SIDES
- FADED TEXT
- ILLEGIBLE TEXT
- SKEWED/SLANTED IMAGES
- COLORED PHOTOS
- BLACK OR VERY BLACK AND WHITE DARK PHOTOS
- GRAY SCALE DOCUMENTS

## IMAGES ARE BEST AVAILABLE COPY.

As rescanning documents will not correct images, please do not report the images to the Image Problem Mailbox.

US Patent & Trademark Office

Subscribe (Full Service) Register (Limited Service, Free) Login

Search: 

The ACM Digital Library 
The Guide

(coherency or coherence) and directory and (owner or owners)

# EACH DICHAL LIBRAR

Feedback Report a problem Satisfaction survey

Terms used coherency or coherence and directory and owner or ownership and tag

Found 2,272 of 138,517

Sort results

relevance

Save results to a Binder Search Tips

Try an Advanced Search Try this search in The ACM Guide

Display results

expanded form

Open results in a new window

Results 1 - 20 of 200

Result page: 1 2 3 4 5 6 7 8 9 10

Relevance scale

Best 200 shown

Adjustable block size coherent caches

Czarek Dubnicki, Thomas J. LeBlanc

April 1992 ACM SIGARCH Computer Architecture News, Proceedings of the 19th annual

Full text available: pdf(1.24 MB)

international symposium on Computer architecture, Volume 20 Issue 2 Additional Information: full citation, abstract, references, citings, index terms

Several studies have shown that the performance of coherent caches depends on the relationship between the granularity of sharing and locality exhibited by the program and the cache block size. Large cache blocks exploit processor and spatial locality, but may cause unnecessary cache invalidations due to false sharing. Small cache blocks can reduce the number of cache invalidations, but increase the nuber of bus or network transactions required to load data into the cache. In this paper we ...

<sup>2</sup> Using destination-set prediction to improve the latency/bandwidth tradeoff in shared-memory multiprocessors

Milo M. K. Martin, Pacia J. Harper, Daniel J. Sorin, Mark D. Hill, David A. Wood May 2003 ACM SIGARCH Computer Architecture News, Proceedings of the 30th annual

international symposium on Computer architecture, Volume 31 Issue 2 Full text available: pdf(220,76 KB) Additional Information: full citation, abstract, references

Destination-set prediction can improve the latency/bandwidth tradeoff in shared-memory multiprocessors. The destination set is the collection of processors that receive a particular coherence request. Snooping protocols send requests to the maximal destination set (i.e., all processors), reducing latency for cache-to-cache misses at the expense of increased traffic. Directory protocols send requests to the minimal destination set, reducing bandwidth at the expense of an indirection through the d ...

<sup>3</sup> Piranha: a scalable <u>architecture based on single-chip multiprocessing</u> Luiz André Barroso, Kourosh Gharachorloo, Robert McNamara, Andreas Nowatzyk, Shaz Qadeer, Barton Sano, Scott Smith, Robert Stets, Ben Verghese May 2000

ACM SIGARCH Computer Architecture News, Proceedings of the 27th annual international symposium on Computer architecture, Volume 28 Issue 2

Full text available: pdf(191.10 KB)

Additional Information: full citation, abstract, references, citings, index terms

The microprocessor industry is currently struggling with higher development costs and longer design times that arise from exceedingly complex processors that are pushing the limits of instruction-level parallelism. Meanwhile, such designs are especially ill suited for important commercial applications, such as on-line transaction processing (OLTP), which suffer from large memory stall times and exhibit little instruction-level parallelism. Given that commercial applications constitute by fa ...

Memory coherence in shared virtual memory systems Kai Li, Paul Hudak

November 1989 ACM Transactions on Computer Systems (TOCS), Volume 7 Issue 4

Full text available: pdf(2.71 MB)

Additional Information: full citation, abstract, references, citings, index terms, review

The memory coherence problem in designing and implementing a shared virtual memory on loosely coupled multiprocessors is studied in depth. Two classes of algorithms, centralized and distributed. for solving the problem are presented. A prototype shared virtual memory on an Apollo ring based on these algorithms has been implemented. Both theoretical and practical results show that the memory coherence problem can indeed be solved efficiently on a loosely coupled multiprocessor.

| 5  | A distributed shared memory multiprocessor ASURA: memory and cache architecture S. Mori, H. Saito, M. Goshima, S. Tomita, M. Yanagihara, T. Tanaka, D. Fraser, K. Joe, H. Nitta December 1993 Proceedings of the 1993 ACM/IEEE conference on Supercomputing                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |  |  |  |  |  |  |
|----|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|--|--|--|--|
|    | Full text available: pdf(1.17 MB)  Additional Information: full citation, references, citings, index terms                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |  |  |  |  |  |  |
| 6  | Synchronization with multiprocessor caches  Joonwon Lee, Umakishore Ramachandran  May 1990 ACM SIGARCH Computer Architecture News , Proceedings of the 17th annual international symposium on Computer Architecture, Volume 18 Issue 3  Full text available: Odf(1.18 MB) Additional Information: full citation, abstract, references, citings, index terms                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |  |  |  |  |  |  |
|    | Introducing private caches in bus-based shared memory multiprocessors leads to the cache consistency problem since there may be multiple copies of shared data. However, the ability to snoop on the bus coupled with the fast broadcast capability allows the design of special hardware support for synchronization. We present a new lock-based cache scheme which incorporates synchronization into the cache coherency mechanism. With this scheme high-level synchronization primitives as well as low-le                                                                                                                                                                                                                                                                                                                            |  |  |  |  |  |  |
| 7  | Delayed consistency and its effects on the miss rate of parallel programs  Michel Dubois, Jin Chin Wang, Luiz A. Barroso, Kangwoo Lee, Yung-Syau Chen  August 1991 Proceedings of the 1991 ACM/IEEE conference on Supercomputing                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |  |  |  |  |  |  |
|    | Full text available: pdf(1.01 MB)  Additional Information: full citation, references, citings, index terms                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |  |  |  |  |  |  |
| 8  | Cache memory performance in a unix enviroment Cedell Alexander, William Keshlear, Furrokh Cooper, Faye Briggs June 1986 ACM SIGARCH Computer Architecture News, Volume 14 Issue 3 Full text available: pdf(2.10 MB)  Additional Information: full citation, citings, index terms                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |  |  |  |  |  |  |
| 9  | An economical solution to the cache coherence problem  James Archibald, Jean Loup Baer  January 1984 ACM SIGARCH Computer Architecture News, Proceedings of the 11th annual international symposium on Computer architecture, Volume 12 Issue 3  Full text available: pdf(728.73 KB)  Additional Information: full citation, abstract, references, citings, index terms  In this paper we review and qualitatively evaluate schemes to maintain cache coherence in tightly-coupled multiprocessor systems. This leads us to propose a more economical (hardware-wise), expandable and modular variation of the "global directory" approach. Protocols for this solution are described. Performance evaluation studies indicate the limits (number of processors, level of sharing) within which this approach is viable.                   |  |  |  |  |  |  |
| 10 | Cache coherence protocols: evaluation using a multiprocessor simulation model  James Archibald, Jean-Loup Baer September 1986 ACM Transactions on Computer Systems (TOCS), Volume 4 Issue 4  Full text available: pdf(1.79 MB)  Additional Information: full citation, abstract, references, citings, index terms, review  Using simulation, we examine the efficiency of several distributed, hardware-based solutions to the cache coherence problem in shared-bus multiprocessors. For each of the approaches, the associated protocol is outlined. The simulation model is described, and results from that model are presented. The magnitude of the potential performance difference between the various approaches indicates that the choice of coherence solution is very important in the design of an efficient shared-bus multi |  |  |  |  |  |  |
| 11 | Cache coherence in systems with parallel communication channels & many processors  John C. Willis, Arthur C. Sanderson, Charles R. Hill                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |  |  |  |  |  |  |

systems with parallel communication channels and many processors. A distributed link-list relates all cache frames representing the same main memory block. Messages traverse the list to maintain list integrity, exclusive ownership, and consistent values. Memory access semantics are equivalent to a shared memory system without caches. Reference latency, efficiency of memory use, and hardware complex ...

This paper describes and analyzes two algorithms for maintaining cache coherence in multiprocessor

Additional Information: full citation, abstract, references

November 1990 Proceedings of the 1990 ACM/IEEE conference on Supercomputing

Full text available: pdf(868.59 KB)

| 42                                                               |                                                                                           |                                                                         |                                                                                       |                                                                                                                                          | _ |
|------------------------------------------------------------------|-------------------------------------------------------------------------------------------|-------------------------------------------------------------------------|---------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------|---|
|                                                                  | ed caching techni<br>. A. Goosen, P. D.                                                   | ques for scalability                                                    | n VMP-M/C                                                                             |                                                                                                                                          |   |
| April 1989 ACM                                                   | SIGARCH Compu                                                                             | ter Architecture Ne                                                     |                                                                                       |                                                                                                                                          |   |
| Full text available:                                             |                                                                                           | um on Computer ar<br>Additional Information: <u>full cit</u>            | ·                                                                                     |                                                                                                                                          |   |
| a scalable men<br>system. In this<br>on the VMP mu               | nory hierarchy, ass<br>s paper, we describ<br>ultiprocessor design<br>none to several tho | suming interprocessor<br>be the VMP-MC design<br>n, that is intended to | communication is h<br>, a distributed parall<br>provide a set of build                | reduced to that of building<br>landled by the memory<br>del multi-computer based<br>ding blocks for configuring<br>ry hierarchy based on |   |
| 13 Options for dyna                                              |                                                                                           | slation in COMAs                                                        |                                                                                       |                                                                                                                                          |   |
|                                                                  | SIGARCH Comput                                                                            | ter Architecture Nev<br>um on Computer are                              |                                                                                       |                                                                                                                                          |   |
|                                                                  |                                                                                           | Additional Information: full cita                                       |                                                                                       |                                                                                                                                          |   |
| In modern produced before or in pa pace and the withe TLB (Trans | rallel with the first<br>vorking sets of new<br>lation Lookaside Bi                       | -level cache access. A<br>applications grow ins                         | s processor technologatiably the latency as and more difficult                        | port virtual memory is done ogy improves at a rapid and bandwidth demands or to meet. The situation is gued by the TLB                   |   |
| P. Sweazey, A. J.<br>June 1986 ACM S                             | Smith<br>SIGARCH Comput                                                                   | ter Architecture Nev                                                    | vs , Proceedings o                                                                    | the IEEE futurebus f the 13th annual                                                                                                     |   |
| intern<br>Full text available: po                                |                                                                                           | um on Computer are<br>Additional Information: <u>full cita</u>          |                                                                                       |                                                                                                                                          |   |
| by different ver<br>define a class o<br>We refer to this         | ndors, implies the loft compatible consing class as the MOE                               | need for a standardize<br>stency protocols supp                         | ed cache consistency<br>orted by the current<br>the term "MOESI" is                   | nmodate boards developed<br>protocol. In this paper we<br>I IEEE Futurebus design.<br>derived from the names o<br>nent ca                | : |
|                                                                  | e, Yozo Nakayama<br>edings of the 200                                                     | , Toshiya Mima                                                          | sia South Pacific d                                                                   | esign automation/VLSI                                                                                                                    |   |
| Full text available:                                             | II(141.86 KB)                                                                             | Additional Information: full cita                                       | lion, abstract                                                                        |                                                                                                                                          |   |
| large cache-coh<br>and complexity<br>biased/constrai             | nerent, non-uniforr<br>of the design and<br>ned random stimu<br>d feedback with co        | n memory access (CC<br>the complexity of the<br>li generator coupled w  | <ul> <li>-NUMA) multi-proce<br/>cache-coherence profith an error detection</li> </ul> | usually quite ineffective on<br>ssors because of the size<br>rotocol. A controllable<br>on mechanism using<br>native methodology. We     |   |
| J. Chapin, M. Rose<br>December 1995 ACM                          | enblum, S. Devine,<br>SIGOPS Operation                                                    | l-memory multiproce<br>T. Lahiri, D. Teodosiu<br>ng Systems Review      | , A. Gupta<br>, Proceedings of t                                                      |                                                                                                                                          |   |
| symp<br>Full text available: pd                                  |                                                                                           | ting systems princi<br>Additional Information: <u>full cital</u>        | •                                                                                     |                                                                                                                                          |   |
| Manuel E. Acacio, .                                              | José González, Jos                                                                        | cache-to-cache trar<br>é M. García, José Dua<br>102 ACM/IEEE confe      | to                                                                                    | c-NUMA architecture                                                                                                                      |   |
| Full text available: pd                                          |                                                                                           | Additional Information: full citat                                      |                                                                                       | · -                                                                                                                                      |   |
| Cache misses fo                                                  | or which data must                                                                        | be obtained from a r                                                    | emote cache (cache                                                                    | -to-cache transfer misses)                                                                                                               |   |

account for an important fraction of the total miss rate. Unfortunately, cc-NUMA designs put the access to the directory information into the critical path of 3-hop misses, which significantly penalizes them compared to SMP designs. This work studies the use of owner prediction as a means of providing cc-NUMA multiprocessors with a more efficient support for cache-to-cache transfer misses. Our propo ...

<sup>18</sup> A cache coherence approach for large multiprocessor systems J. K. Archibald

Proceedings of the 2nd international conference on Supercomputing June 1988

Full text available: pdf(1.05 MB)

Additional Information: full citation, abstract, references, citings, index terms

This paper explores the architecture of high-performance large scale multiprocessors using private caches for each processor. The caches reduce the average memory access time, but they also result in the well known cache coherence problem. Multiple copies of each memory location are allowed to exist but they must be kept consistent with each other. In this paper, we present a solution to the cache coherence problem specifically for shared bus multiprocessors that adapts dyn ...

19 Token coherence: decoupling performance and correctness

Milo M. K. Martin, Mark D. Hill, David A. Wood

ACM SIGARCH Computer Architecture News, Proceedings of the 30th annual international symposium on Computer architecture, Volume 31 Issue 2

Full text available: pdf(269.08 KB)

Additional Information: full citation, abstract, references

Many future shared-memory multiprocessor servers will both target commercial workloads and use highly-integrated "glueless" designs. Implementing low-latency cache coherence in these systems is difficult, because traditional approaches either add indirection for common cache-to-cache misses (directory protocols) or require a totally-ordered interconnect (traditional snooping protocols). Unfortunately, totally-ordered interconnects are difficult to implement in glueless designs. An ideal coherenc ...

<sup>20</sup> Multicast snooping: a new coherence method using a multicast address network E. Ender Bilir, Ross M. Dickson, Ying Hu, Manoj Plakal, Daniel J. Sorin, Mark D. Hill, David A. Wood ACM SIGARCH Computer Architecture News, Proceedings of the 26th annual international symposium on Computer architecture, Volume 27 Issue 2

Full text available

pdf(98.12 KB) Publisher Additional Information: full citation, abstract, references, citings, index terms

This paper proposes a new coherence method called "multicast snooping" that dynamically adapts between broadcast snooping and a directory protocol. Multicast snooping is unique because processors predict which caches should snoop each coherence transaction by specifying a multicast "mask." Transactions are delivered with an ordered multicast network, such as an Isotach network, which eliminates the need for acknowledgment messages. Processors handle transactions as they would with a snoop ...

Results 1 - 20 of 200

Result page: 1 2 3 4 5 6 7 8 9 10 next

The ACM Portal is published by the Association for Computing Machinery. Copyright @ 2004 ACM, Inc. Terms of Usage Privacy Policy Code of Ethics Contact Us

Useful downloads: Adobe Acrobat QuickTime Windows Media Player

Subscribe (Full Service) Register (Limited Service, Free) Login

Search: 
The ACM Digital Library The Guide

US Patent & Trademark Office

(coherency directory) and (owner or ownership) and tag and o



## THE ACM DIGITAL LIBRAR

Feedback Report a problem Satisfaction survey

Terms used

coherency directory and owner or ownership and tag and owned and global

Found 2.797 of 138.517

Sort results bv

relevance

Save results to a Binder Search Tips

Try an Advanced Search Try this search in The ACM Guide

Display results

expanded form

Open results in a new window

Results 1 - 20 of 200

Result page: 1 2 3 4 5 6 7 8 9 10

Best 200 shown

Relevance scale

Adjustable block size coherent caches

Czarek Dubnicki, Thomas J. LeBlanc

ACM SIGARCH Computer Architecture News, Proceedings of the 19th annual international symposium on Computer architecture, Volume 20 Issue 2

Full text available: pdf(1.24 MB)

Additional Information: full citation, abstract, references, citings, index terms

Several studies have shown that the performance of coherent caches depends on the relationship between the granularity of sharing and locality exhibited by the program and the cache block size. Large cache blocks exploit processor and spatial locality, but may cause unnecessary cache invalidations due to false sharing. Small cache blocks can reduce the number of cache invalidations, but increase the nuber of bus or network transactions required to load data into the cache. In this paper we ...

<sup>2</sup> Cache memory performance in a unix enviroment

Cedell Alexander, William Keshlear, Furrokh Cooper, Faye Briggs June 1986 ACM SIGARCH Computer Architecture News, Volume 14 Issue 3

Full text available: pdf(2,10 MB)

Additional Information: full citation, citings, index terms

Memory coherence in shared virtual memory systems

Kai Li, Paul Hudak

November 1989 ACM Transactions on Computer Systems (TOCS), Volume 7 Issue 4

Full text available: pdf(2,71 MB)

Additional Information: full citation, abstract, references, citings, index terms, review

The memory coherence problem in designing and implementing a shared virtual memory on loosely coupled multiprocessors is studied in depth. Two classes of algorithms, centralized and distributed, for solving the problem are presented. A prototype shared virtual memory on an Apollo ring based on these algorithms has been implemented. Both theoretical and practical results show that the memory coherence problem can indeed be solved efficiently on a loosely coupled multiprocessor.

<sup>4</sup> Delayed consistency and its effects on the miss rate of parallel programs Michel Dubois, Jin Chin Wang, Luiz A. Barroso, Kangwoo Lee, Yung-Syau Chen August 1991 Proceedings of the 1991 ACM/IEEE conference on Supercomputing

Full text available: pdf(1.01 MB)

Additional Information: full citation, references, citings, index terms

A class of compatible cache consistency protocols and their support by the IEEE futurebus P. Sweazey, A. J. Smith

June 1986 ACM SIGARCH Computer Architecture News, Proceedings of the 13th annual international symposium on Computer architecture, Volume 14 Issue 2

Full text available: pdf(1.05 MB)

Additional Information: full citation, abstract, references, citings, index terms

Standardization of a high performance blackplane bus, so that it can accommodate boards developed by different vendors, implies the need for a standardized cache consistency protocol. In this paper we define a class of compatible consistency protocols supported by the current IEEE Futurebus design. We refer to this class as the MOESI class of protocols; the term "MOESI" is derived from the names of the states. This class of protocols has the property that any system component ca ...

Multi-level shared caching techniques for scalability in VMP-M/C D. R. Cheriton, H. A. Goosen, P. D. Boyle April 1989 ACM SIGARCH Computer Architecture News , Proceedings of the 16th annual international symposium on Computer architecture, Volume 17 Issue 3 Full text available: pdf(1,27 MB) Additional Information: full citation, abstract, references, citings, index terms The problem of building a scalable shared memory multiprocessor can be reduced to that of building a scalable memory hierarchy, assuming interprocessor communication is handled by the memory system. In this paper, we describe the VMP-MC design, a distributed parallel multi-computer based on the VMP multiprocessor design, that is intended to provide a set of building blocks for configuring machines from one to several thousand processors. VMP-MC uses a memory hierarchy based on shared caches ... Cache coherence protocols: evaluation using a multiprocessor simulation model James Archibald, Jean-Loup Baer September 1986 ACM Transactions on Computer Systems (TOCS), Volume 4 Issue 4 Full text available: pdf(1.79 MB) Additional Information: full citation, abstract, references, citings, index terms, review Using simulation, we examine the efficiency of several distributed, hardware-based solutions to the cache coherence problem in shared-bus multiprocessors. For each of the approaches, the associated protocol is outlined. The simulation model is described, and results from that model are presented. The magnitude of the potential performance difference between the various approaches indicates that the choice of coherence solution is very important in the design of an efficient shared-bus multi ... A cache consistency protocol for multiprocessors with multistage networks P. Stenström April 1989 ACM SIGARCH Computer Architecture News, Proceedings of the 16th annual international symposium on Computer architecture, Volume 17 Issue 3 Additional Information: full citation, abstract, references, citings, index terms A hardware based cache consistency protocol for multiprocessors with multistage networks is proposed. Consistency traffic is restricted to the set of caches which have a copy of a shared block. State information is distributed to the caches and the memory modules need not be consulted for consistency actions. The protocol provides two operating modes: distributed write and global read. Distribution of writes calls for efficient multicast methods. Communication cost for multicasti ... A cache coherence approach for large multiprocessor systems J. K. Archibald June 1988 Proceedings of the 2nd international conference on Supercomputing Additional Information: full citation, abstract, references, citings, index terms Full text available: pdf(1.05 MB) This paper explores the architecture of high-performance large scale multiprocessors using private caches for each processor. The caches reduce the average memory access time, but they also result in the well known cache coherence problem. Multiple copies of each memory location are allowed to exist but they must be kept consistent with each other. In this paper, we present a solution to the cache coherence problem specifically for shared bus multiprocessors that adapts dyn ... <sup>10</sup> The sun fireplane system interconnect Alan Charlesworth November 2001 Proceedings of the 2001 ACM/IEEE conference on Supercomputing (CDROM) Additional Information: full citation, abstract, references, citings, index terms Full text available: pdf(224.87 KB) System interconnect is a key determiner of the cost, performance, and reliability of large cachecoherent, shared-memory multiprocessors. Interconnect implementations have to accommodate ever greater numbers of ever faster processors. This paper describes the Sun™ Fireplane two-level cachecoherency protocol, and its use in the medium and large-sized UltraSPARC-III-based Sun Fire™ servers. <sup>11</sup> Architecture and design of AlphaServer GS320 Kourosh Gharachorloo, Madhu Sharma, Simon Steely, Stephen Van Doren November 2000 Proceedings of the ninth international conference on Architectural support for programming languages and operating systems, Volume 28, 34 Issue 5, 5 Additional Information: full citation, abstract, references, citings, index terms Full text available: pdf(413.91 KB) This paper describes the architecture and implementation of the AlphaServer GS320, a cachecoherent non-uniform memory access multiprocessor developed at Compaq. The AlphaServer GS320 architecture is specifically targeted at medium-scale multiprocessing with 32 to 64 processors. Each

node in the design consists of four Alpha 21264 processors, up to 32GB of coherent memory, and an

aggressive IO subsystem. The current implementation supports up to 8 such nodes for a total of 32 processors. While s ... <sup>12</sup> Architecture and design of AlphaServer GS320 Kourosh Gharachorloo, Madhu Sharma, Simon Steely, Stephen Van Doren November 2000 ACM SIGPLAN Notices, Volume 35 Issue 11 Additional Information: full citation, abstract, references, citings, index terms Full text available: pdf(1.67 MB) This paper describes the architecture and implementation of the AlphaServer GS320, a cachecoherent non-uniform memory access multiprocessor developed at Compaq. The AlphaServer GS320 architecture is specifically targeted at medium-scale multiprocessing with 32 to 64 processors. Each node in the design consists of four Alpha 21264 processors, up to 32GB of coherent memory, and an aggressive IO subsystem. The current implementation supports up to 8 such nodes for a total of 32 processors. While s ... <sup>13</sup> Implementing a cache consistency protocol R. H. Katz, S. J. Eggers, D. A. Wood, C. L. Perkins, R. G. Sheldon ACM SIGARCH Computer Architecture News, Proceedings of the 12th annual international symposium on Computer architecture, Volume 13 Issue 3 Additional Information: full citation, citings, index terms Full text available: pdf(803.11 KB) Keywords: ownership-based protocols, shared bus multicomprocessor cache consistency, single chip implementation, snooping caches Sensor databases: Cache-and-query for wide area sensor databases Amol Deshpande, Suman Nath, Phillip B. Gibbons, Srinivasan Seshan June 2003 Proceedings of the 2003 ACM SIGMOD international conference on on Management of data Additional Information: full citation, abstract, references, index terms Full text available: pdf(230,75 KB) Webcams, microphones, pressure gauges and other sensors provide exciting new opportunities for querying and monitoring the physical world. In this paper we focus on querying wide area sensor databases, containing (XML) data derived from sensors spread over tens to thousands of miles. We present the first scalable system for executing XPATH queries on such databases. The system maintains the logical view of the data as a single XML document, while physically the data is fragmented across a ... <sup>15</sup> Piranha: a scalable architecture based on single-chip multiprocessing Luiz André Barroso, Kourosh Gharachorloo, Robert McNamara, Andreas Nowatzyk, Shaz Qadeer, Barton Sano, Scott Smith, Robert Stets, Ben Verghese May 2000 ACM SIGARCH Computer Architecture News, Proceedings of the 27th annual international symposium on Computer architecture, Volume 28 Issue 2 Additional Information: full citation, abstract, references, citings, index terms Full text available: pdf(191.10 KB) The microprocessor industry is currently struggling with higher development costs and longer design times that arise from exceedingly complex processors that are pushing the limits of instruction-level parallelism. Meanwhile, such designs are especially ill suited for important commercial applications, such as on-line transaction processing (OLTP), which suffer from large memory stall times and exhibit little instruction-level parallelism. Given that commercial applications constitute by fa ...

A distributed shared memory multiprocessor ASURA: memory and cache architecture S. Mori, H. Saito, M. Goshima, S. Tomita, M. Yanagihara, T. Tanaka, D. Fraser, K. Joe, H. Nitta December 1993 Proceedings of the 1993 ACM/IEEE conference on Supercomputing

Full text available: pdf(1.17\_MB)

Additional Information: full citation, references, citings, index terms

<sup>17</sup> An empirical evaluation of two memory-efficient directory methods Brian W. O'Krafka, A. Richard Newton

ACM SIGARCH Computer Architecture News, Proceedings of the 17th annual international symposium on Computer Architecture, Volume 18 Issue 3

Full text available: pdf(1.19 MB)

Additional Information: full citation, abstract, references, citings, index terms

This paper presents an empirical evaluation of two memory-efficient directory methods for maintaining coherent caches in large shared memory multiprocessors. Both directory methods are modifications of a scheme proposed by Censier and Feautrier [5] that does not rely on a specific

interconnection network and can be readily distributed across interleaved main memory. The schemes considered here overcome the large amount of memory required for tags in the original scheme in two different ways ...

<sup>18</sup> Pre-silicon verification of the Alpha 21364 microprocessor error handling system Richard Lee, Benjamin Tsien

June 2001 Proceedings of the 38th conference on Design automation

Full text available: ndf(176.87 KB)

Additional Information: full citation, abstract, references, index terms

This paper presents the strategy used to verify the error logic in the Alpha 21364 microprocessor. Traditional pre-silicon strategies of focused testing or unit-level random testing yield limited results in finding complex bugs in the error handling logic of a microprocessor. This paper introduces a technique to simulate error conditions and their recovery in a global environment using random test stimulus closely approximating traffic found in a real system. A significant number of bugs ...

19 Parallel architectures: Inferential queueing and speculative push for reducing critical communication latencies

Ravi Rajwar, Alain Kägi, James R. Goodman

Proceedings of the 17th annual international conference on Supercomputing June 2003

Full text available: pdf(568,93 KB)

Additional Information: full citation, abstract, references, index terms

Communication latencies within critical sections constitute a major bottleneck in some classes of emerging parallel workloads. In this paper, we argue for the use of Inferentially Queued Locks (IQLs) [31], not just for efficient synchronization but also for reducing communication latencies, and we propose a novel mechanism, Speculative Push (SP), aimed at reducing these communication latencies. With IQLs, the processor infers the existence, and limits, of a critical section from the use of synch ...

Keywords: data forwarding, inferential queueing, synchronization

Options for dynamic address translation in COMAs

Xiaogang Qiu, Michel Dubois

**April 1998** ACM SIGARCH Computer Architecture News, Proceedings of the 25th annual

international symposium on Computer architecture, Volume 26 Issue 3

Full text available

Publisher Additional Information: full citation, abstract, references, citings, index terms

In modern processors, the dynamic translation of virtual addresses to support virtual memory is done before or in parallel with the first-level cache access. As processor technology improves at a rapid pace and the working sets of new applications grow insatiably the latency and bandwidth demands on the TLB (Translation Lookaside Buffer) are getting more and more difficult to meet. The situation is worse in multiprocessor systems, which run larger applications and are plagued by the TLB consiste ...

Results 1 - 20 of 200

Result page: 1 2 3 4 5 6 7 8 9 10

The ACM Portal is published by the Association for Computing Machinery. Copyright © 2004 ACM, Inc. Terms of Usage Privacy Policy Code of Ethics Contact Us

Useful downloads: Adobe Acrobat QuickTime Windows Media Player



"coherency directory" or "coherence directory" or "global direct



### **Nothing Found**

Your search for "coherency directory" or "coherence directory" or "global directory" did not return any results.

You may want to try an Advanced Search for additional options.

Please review the Quick Tips below or for more information see the Search Tips.

#### **Quick Tips**

• Enter your search terms in lower case with a space between the terms.

sales offices

You can also enter a full question or concept in plain language.

Where are the sales offices?

Capitalize proper nouns to search for specific people, places, or products.

John Colter, Netscape Navigator

• Enclose a phrase in double quotes to search for that exact phrase.

"museum of natural history" "museum of modern art"

Narrow your searches by using a + if a search term <u>must appear</u> on a page.

museum +art

• Exclude pages by using a = if a search term <u>must not appear</u> on a page.

museum -Paris

Combine these techniques to create a specific search query. The better your description of the information you want, the more relevant your results will be.

museum +"natural history" dinosaur -Chicago

The ACM Portal is published by the Association for Computing Machinery. Copyright © 2004 ACM, Inc. Terms of Usage Privacy Policy Code of Ethics Contact Us

Useful downloads: Adobe Acrobat

Q QuickTime
Windows Media Player

Real Player